Norwegian University of Science and Technology Technical Report IDI-TR-1/2003 Algorithms for Granularity Reduction in Temporal Document Databases
نویسنده
چکیده
With rapidly decreasing storage costs temporal document databases is now a viable solution in many contexts. However, storing an ever growing database can still be too costly, and as a consequence it is desirable to be able to physically delete old versions. Traditionally, this has been performed by an operation called vacuuming, where the oldest versions are physically deleted (or migrated from secondary storage to cheaper tertiary storage). However, in temporal document databases it is more appropriate to remove intermediate versions instead of removing the oldest versions. We call this operation granularity reduction. In this paper we describe six approaches to granularity reduction, and discuss advantages and disadvantages of these approaches. Three of the approaches have been implemented into the V2 temporal document database system, and in this context we discuss the cost of applying the approaches.
منابع مشابه
Norwegian University of Science and Technology Technical report IDI-TR-11/2002 Supporting Temporal Text-Containment Queries
In temporal document databases and temporal XML databases, temporal text-containment queries are a potential performance bottleneck. In this paper we describe how to manage documents and index structures in such databases in way that makes temporal text-containment querying feasible. We describe and discuss different index structures that can improve such queries. Three of the alternatives have...
متن کاملNorwegian University of Science and Technology Technical report IDI-TR-X/2002, last revised: 2002-09-02 V2: A Database Approach to Temporal Document Management
The advent of large amounts of data on the web has closed the gap between the document storage and database communities. In this paper, this work is continued by the description of the foundations for temporal document databases. We describe the V2 temporal document database, which supports storage, retrieval, and querying of temporal documents. We describe functionality and operations/operator...
متن کاملNorwegian University of Science and Technology Technical report IDI-TR-10/2002 Design, Implementation, and Performance of the V2 Temporal Document Database System
The advent of large amounts of data on the web has closed the gap between the document storage and the database communities. In this paper, this work is continued by the description of the foundations for temporal document databases. We describe functionality and operations/operators to be supported by such systems, and more specifically we describe the architecture for management of temporal d...
متن کاملNorwegian University of Science and Technology Technical Report IDI-TR-09/2007 Semantic-Based Association Rule Mining of Temporal Document Collections
In many contexts today we have documents available in a number of versions. In addition to explicit knowledge that can be queried/searched in documents, these documents also contain implicit knowledge that can be found by text mining. In this paper we will study association rule mining of temporal document collections, and extend our previous work by 1) performing mining based on semantics as w...
متن کاملNorwegian University of Science and Technology Technical Report IDI-TR-05/2008 PROQID: Partial restarts of queries in distributed databases
In a number of application areas, distributed database systems can be used to provide persistent storage of data while providing efficient access for both local and remote data. With an increasing number of sites (computers) involved in a query, the probability of failure at query time increases. Recovery has previously only focused on database updates while query failures have been handled by ...
متن کامل